Classifying Numeric Information
نویسنده
چکیده
Learning programs that try to generalize from real-world examples may have to deal with many different kinds of data. Continuous numeric data may cause problems for algorithms that search for identical aspects of examples. This problem can be .. . surmounted by categori=ing the nume-ric data. However, this process has problems of its own. In this paper we look at the need for categorizing numeric data, and several methods for doing so. \V e concentrate on the use of a heuristic, looking for gaps, that has been implemented in the UNIMEM computer system. An example is presented of this algorithm categorizing data about states of the United States.
منابع مشابه
A Machine Learning Approach to Speech Act Classification Using Function Words
This paper presents a novel technique for the classification of sentences as Dialogue Acts, based on structural information contained in function words. It focuses on classifying questions or non-questions as a generally useful task in agent-based systems. The proposed technique extracts salient features by replacing function words with numeric tokens and replacing each content word with a stan...
متن کاملStatistics for Categorical Surveys—A New Strategy for Multivariate Classification and Determining Variable Importance
Surveys can be a rich source of information. However, the extraction of underlying variables from the analysis of mixed categoric and numeric survey data is fraught with complications when using grouping techniques such as clustering or ordination. Here I present a new strategy to deal with classification of households into clusters, and identification of cluster membership for new households. ...
متن کاملFacial Expression Recognition Using Interpolation Features
In this work, a methodology for classifying emotions (such as happiness, anger and surprise) based on face images is proposed. This methodology consist of three stages: in the pre-processing stage, edge detectors and threshold algorithms are used in order to find edge information about ROIs; in the second stage (feature extraction) numeric information of pre-processing images is extracted via i...
متن کاملSecurity Metrology and the Monty Hall Problem
Evaluating computing systems and classifying them by the security properties they provide is not new [13, 14]. Other researchers [8, 9] have pointed out the difficulty of evaluating security and the apparent binary nature of security given discoveries of system vulnerability. Here, I compare the role of security evaluations with that of cryptographic security parameters, and relate the difficul...
متن کاملUsing Fuzzy LR Numbers in Bayesian Text Classifier for Classifying Persian Text Documents
Text Classification is an important research field in information retrieval and text mining. The main task in text classification is to assign text documents in predefined categories based on documents’ contents and labeled-training samples. Since word detection is a difficult and time consuming task in Persian language, Bayesian text classifier is an appropriate approach to deal with different...
متن کامل